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PROJECT  TITLE:  SYSTEMS  BIOLOGY  OF  DEHALOCOCCOIDES:  USING  NETWORK 
INFERENCE  MODELING  TO  INTEGRATE  OMICS  DATASETS  UNDER  VARIED 
CONDITIONS 

DoD  Award:  W91  INF-07- 1-0249 
ABSTRACT 

The  overall  aims  of  this  project  were  to  1.  obtain  a  systems-biology  level  understanding  of  gene 
networks  in  the  Dehalococcoides  and  2.  develop  assays  for  quantitative  biomarkers  of 
chloroethene  detoxification  steps  and  rates  that  could  be  deployed  at  sites  undergoing  in  situ 
bioremediation  utilizing  this  important  group  of  microbes.  After  collecting  genome-wide 
microarray  expression  data  along  with  metabolite  and  chloroethene  data,  under  a  wide  range  of 
conditions  (n  =  53),  we  employed  Bayesian  inference  algorithms  to  reconstruct  the  gene-gene 
and  gene -metabolite  subnetworks  that  are  most  supported  by  the  expression  data.  These 
networks  then  led  to  the  discovery  of  robust  biomarker  candidates  of  respiration  (key 
dehalogenases  and  the  core  hydrogenase)  as  well  as  of  stress  (e.g  chaperones,  heat  shock 
proteins,  superoxide  dismutase).  Quantitative  assays  for  these  biomarkers  were  then  developed 
and  applied  to  examine  the  power  of  protein  and/or  RNA  biomarker  levels  to  serve  as  estimators 
of  bulk  culture  respiration  rates.  Collectively  the  work  is  the  most  comprehensive  picture  of 
Dehalococcoides’  systems  biology  yet  available.  On  the  applied  side,  RNA  biomarkers  were 
responsive  to  respiration  rate  down  to  0.0 lx  the  populations’  maximum  -  thereby  demonstrating 
a  wide  range  over  which  they  are  infonnative.  These  RNA-respiration  trends  were  also  tested  in 
an  additional  strain  of  Dehalococcoides  (KB1  -  a  commercially  sold  bioaugmentation  culture 
that  has  been  deployed  at  dozens  of  sites  across  the  country)  and  results  for  RNA  from  the 
hydrogenase  enzyme  “Hup”  were  remarkable  consistent  across  the  organisms  in  trend  and 
absolute  abundance  (in  transcripts  per  cell).  Application  of  the  mass  spectrometry-based  protein 
biomarker  quantification  method  we  developed  enabled  calculation  of  enzyme  kinetic  rate 
parameters  in  vivo  for  two  key  reductive  dehalogenase  enzymes:  TCE-Reductive  dehalogenase 
and  PCE-Reductive  dehalogenase. 

Statement  of  the  problem  studied 

Organisms  in  the  group  “ Dehalococcoides ”  are  remarkable  for  their  ability  to  respire  a  wide 
range  of  chlorinated  organic  compounds.  Application  of  DHC  physiology  at  field  sites  has 


grown  widespread  for  the  treatment  of  groundwater  chlorinated  solvent  plumes  (PCE  and  TCE) 
and  there  is  growing  interest  in  extending  application  to  bioremediation  of  PCB,  chlorinated 
benzenes,  and  chlorinated  phenols.  However,  in  situ  bioremediation  requires  effective 
documentation  to  government  agencies.  mRN A/protein  biomarkers  that  are  specific  to 
detoxification  processes  are  being  explored  for  exactly  this  purpose.  Additionally,  at  sites  where 
remediation  has  stalled  or  slowed  to  unacceptable  rates,  it  would  be  useful  to  monitor  stress 
response  biomarkers  to  aid  site  managers  in  appropriate  corrective  action. 

Another  major  hurdle  toward  better  understanding  of  the  Dehalococcoides’  biology  is  the  fact 
that  no  genetic  system  has  yet  succeeded  -  thereby  precluding  traditional  strategies  for  functional 
annotation  of  the  organisms’  -1600  genes.  Another  strategy  must  therefore  be  employed  to 
suggest  gene  function.  In  this  work  we  took  the  approach  of  combining  g  genome- wide  gene 
expression  profiling  with  Bayesian  inference  algorithms  to  generate  data-driven  models  of  gene 
networks  and  gene -metabolite  interactions. 


Summary  of  the  most  important  results 

Detailed  results  for  this  work  are  provided  in  the  mansucripts,  technical  reports,  and  PhD 
dissertations  uploaded  as  part  of  this  final  report.  Here  we  quickly  touch  upon  some  of  the  key 
findings  vis-a-vis  questions  posed  in  the  original  proposal.  These  questions,  listed  below,  are 
addressed  in  the  subsequent  paragraphs. 

1.  Are  biomarkers  expressed  in  other  cultured/field  strains  of 

Dehalococcoides ?  How  do  candidate  biomarker  levels  correlate  with  respiration  rate  over  a 
wider  range  of  growth  conditions? 

2.  What  networks  of  RDases  emerge  in  cultures  grown  on  different  substrates? 

Are  there  specific  transcriptional  regulators  with  expression  tied  to 
individual  or  groups  of  RDases? 

3.  Are  individual  RDases  co-regulated  with  other  elements  of  the 
proposed  electron  transport  chain  (e.g  Hup)? 

4.  Which  gene  networks  correlate  with  the  presence  of  other  community  members? 

5.  Which  biomarkers  are  indicative  of  DHC  stress? 


1.  Are  biomarkers  expressed  in  other  cultured/field  strains  of 
Dehalococcoides?  How  do  candidate  biomarker  levels  correlate  with  respiration  rate  over  a 
wider  range  of  growth  conditions? 


Shotgun  proteomics  were  conducted  in  an  additional  three  Dehalococcoides-containing 
cultures  from  three  different  labs  around  the  world.  In  all  cases  the  same  key  hydrogenase  was 
confirmed  at  the  protein  level  (“Hup”),  while  reductive  dehalogenases  varied  from  culture  to 
culture.  Quantitative  reverse  transcription  PCR  (qRTPCR)  in  two  cultures  showed  remarkable 
similarity  in  both  the  trends  between  mRNA  levels  and  respiration  rates  as  well  as  the  absolute 
levels  (transcripts  per  cell).  The  vast  majority  of  this  work  is  presented  in  an  uploaded 
manuscript  (Rowe  et  al,  title:  “Relating  Chloroethene  Respiration  in  Dehalococcoides  to  Protein 
and  mRNA  Biomarkers:  Implications  for  Protein  Production  and  Enzyme-Specific  Rate 
Parameters”)  and  the  comparison  between  the  two  cultures  in  the  most  promising  respiration 
biomarker  (Hup)  is  given  in  the  following  figures  {Dehalococcoides  strain  195  in  the  left  panel 
and  Dehalococcoides  KB1  In  the  right  panel).  In  both  cases,  the  trends  are  linear  on  the  log-log 
plots  (power  law  relationship)  over  two  orders  of  magnitude  of  per-cell  respiration  rates. 
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2.  What  networks  of  RDases  emerge  in  cultures  grown  on  different  substrates? 

Are  there  specific  transcriptional  regulators  with  expression  tied  to 
individual  or  groups  of  RDases? 

The  complete  dataset  altered  both  the  electron  donor  and  acceptor  rate  and  type  fed  to  the  Donna 
II  mixed  community  culture  at  a  continuous  rate  as  mentioned  previously.  Within  the  chlorinated 
ethene  family,  the  previously  monitored  reductive  dehalogenases  of  tceA  (DET0079),  pceA 
(DET0318),  DET1559,  and  DET1545  were  detected  -but  none  were  specific  to  a  particular 
compound,  rather  they  were  expressed  whenever  any  chlorinated  ethene  was  ed.  Additionally, 
the  reductive  dehalogenases  DET0180  and  DET0174  displayed  statistically  confident 


representation.  DET0162,  the  reductive  dehalogenase  with  the  known  and  reported  point 
mutation,  also  displayed  a  significant  intensity.  The  remaining  RDases  did  not  distinguish 
themselves  from  the  background  intensity  of  the  microarrays  preformed  when  average  intensities 
across  all  53  experiments  were  averaged.  However,  in  individual  experiments  some  of  the 
“minor”  RDases  show  strong  up  regulation  well  above  background  intensity  (e.g.  DET1 171’s 
specific  response  to  addition  of  chlorphenols  as  presented  in  the  manuscript  by  Mansfeldt  et 
al.2012;  NOTE:  as  this  manscript  has  not  yet  been  submitted  is  was  listed  as  a  Technical  Report). 
DET1 171’s  very  specific  expression  when  grown  on  chlorophenols  suggest  it  has  a  unique 
function  against  this  compound  class. 

In  the  Gene  network  analysis,  several  distinct  networks  emerge  with  the  RDases  mentioned 
above.  The  transcript  for  tceA  (DET0079)  resides  in  a  gene  network  (Figure  #)  with  some  other 
expressed  transcripts  such  as  the  putative  S-layer  Protein  DET1407  and  a  putative  transcriptional 
regulator  (DET01 17). 


The  DET1559  reductive  dehalogenase  transcript,  its  associated  anchoring  protein  (DET1558), 
and  its  histidine-kinase  response  regulator  (HK/RR)  pair  (DET 1560, 1561)  all  are  linked  in  a 
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single  network.  This  supports  both  the  predicted  operon  structure  of  this  reductive  dehalogenase 
as  well  as  the  relationship  the  RDase  has  with  the  histidine-kinase  receptor.  However,  histidine- 
kinase  receptors  regulate  the  expression  of  the  gene  transcript  by  their  phosphoralization  state 


and  therefore  the  linking  of  the  expression  values  must  be  analyzed  with  caution. 


3.  Are  individual  RDases  co-regulated  with  other  elements  of  the 
proposed  electron  transport  chain  (e.g  Hup)? 

Surprisingly,  the  only  RDases  that  clusters  with  Hup  is  one  of  the  “minor”  RDases  -  DET1545. 
Interestingly,  though,  of  all  the  genes  in  the  genome,  Hup’s  strongest  association  was  with  a 
putative  NADH/ubiquinone  oxidoreductase  (“Nuo”)  (edge  score  between  Hup  &  Nuo  =  0.849). 
This  has  led  to  the  hypothesis  that  electrons  stripped  from  H2  are  first  passed  to  Nuo  as  the  first 
step  in  the  electron  transport  chain  in  these  organisms.  Biochemical  assays  with  purified 
enzymes  would  be  needed  to  confirm  this  proposed  interaction. 

4.  Which  gene  networks  correlate  with  the  presence  of  other  community  members? 

In  addition  to  the  DET  genes,  the  microarray  also  had  probes  for  other  community  members’  16S 

rRNA  genes.  Although  the  vast  majority  of  edges  in  the  REFS  Gene  Network  model,  there  were 
several  cases  of  DET  genes’  interacting  strongly  with  other  community  members  16S  rRNA 
intensity.  These  include  several  Gram  positive  organisms.  The  Gram  positives  are  now  thought 
to  be  the  major  butyrate  fermenters  in  the  culture  and  Dehalococcoides  is  dependent  upon  them 
for  the  H2  gas  utilized  as  the  sole  electron  donor. 

5.  Which  biomarkers  are  indicative  of  DHC  stress? 

Cultures  were  exposed  to  the  following  stresses  in  these  studies:  PCE  starvation,  PCE  solvent 
effect,  and  02  stress.  A  list  of  the  Clusters  of  Orthologous  Genes  (COGs)  categories  in  D. 
ethenogenes  was  obtained  using  the  Joint  Genome  Institute  (http://img.igi.doe.gov).  The  stress 
response  genes  were  predicted  to  be  annotated  with  the  COGs  of  “posttranslational  modification, 
protein  turnover,  chaperones”  and  “replication,  recombination  and  repair”.  A  total  of  141 
potential  stress  response  genes  were  identified  in  the  genome,  with  57  from  the  “posttranslational 
modification,  protein  turnover,  chaperones”  COG  and  84  from  the  “replication,  recombination 
and  repair”  COG.  This  number  is  corrected  for  the  gene  products  which  have  been  annotated 
twice  with  different  locus  tags.  Of  these  many  were  expressed  highly  at  the  RNA  level. 
However,  most  were  highly  expressed  in  control  cultures  that  were  actively  and  happily 
respiring.  Using  a  combination  of  the  microarray  and  shotgun  proteomics  data,  a  surpising  gene 
was  identified  as  specifically  responding  to  Oxygen  stress:  DET  1 132,  which  codes  for  a  protein 
involved  in  glutamine  biosynthesis.  Why  this  core  metabolic  protein  would  be  beneficial  to 
handling  the  oxidative  stress  is  as  yet  unclear. 


